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Abstract 

We reduce the best known approximation ratio for finding a weighted matching of 
a graph using a one-pass semi-streaming algorithm from 5.828 to 5.585. The semi- 
streaming model forbids random access to the input and restricts the memory to 
C(n-polylog n) bits. It was introduced by Muthukrishnan in 2003 and is appropriate 
when dealing with massive graphs. 
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1 Introduction 

Matching. Consider an undirected graph G — (V, E) without multi-edges or loops, 
where n and m are the number of the vertices and edges, respectively. Let furthermore 
w : E — > R + be a function that assigns a positive weight w(e) to each edge e. A matching 
in G is a subset of the edges such that no two edges in the matching have a vertex in 
common. With w{M) := X^eeM w i e ) being the weight of a matching M, the maximum 
weighted matching problem MWM is to find a matching in G that has maximum weight 
over all matchings in G. 

That problem is well studied and exact solutions in polynomial time are known, 
see |12) for an overview. The fastest algorithm is due to Gabowjl] and runs in time 
0(nm + n 2 logn). 

Approximation Algorithms. When processing massive graphs even the fastest exact 
algorithms computing an MWM are too time-consuming. Examples where weighted 
matchings in massive graphs must be calculated are the refinement of FEM nets [7\ and 
multilevel partitioning of graphs [5] . 

To deal with such graphs there has been effort to find algorithms that in a much shorter 
running time compute solutions that are not necessarily optimal but have some guaran- 
teed quality. Such algorithms are called approximation algorithms and their performance 
is given by an approximation ratio. A matching algorithm achieves a c-approximation 
ratio if for all graphs the algorithm finds a matching M such that w(M) > w ^ where 
M* is a matching of maximum weight. 

A 2-approximation algorithm computing a matching in time 0(m) was given by Preis 
[llj . The best known approximation ratio approachable in linear time is (3/2 + e) for 
an arbitrarily small but constant e. This ratio is obtained by an algorithm of Drake and 
Hougardy[Tj in time 0(m ■ i), an algorithm of Pettie and Sanders [TUj gets the same ratio 
slightly faster in time 0(m ■ log i). 
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Streaming Model. If we consider graphs being too big to run exact MWM algorithms 
on them, also an assumption of the classical RAM model is put in question: It is by no 
means the case that a massive graph can always be assumed as being stored completely 
within main memory, it is rather stored on disks or even tapes. Now seek times of 
read/write heads are dominating the running time. Thus for algorithms as the above 
ones that do not consider the peculiarities of external memory the running time totally 
get out of hand. 

To develop time-efficient algorithms working on these storage devices it is reasonable 
to assume the input of the algorithm (which is the output of the storage devices) to be 
a sequential stream. While tapes produce a stream as their natural output, disks reach 
much higher output rates when presenting their data sequentially in the order it is stored. 

Streaming algorithms are developed to deal with such large amounts of data arriving 
as a stream. In the classical data stream model, see e.g. [5], [S], the algorithm has to 
process the input stream using a working memory that is small compared to the length 
of the input. In particular the algorithm is unable to store the whole input and therefore 
has to make space-efficient summarizations of it according to the query to be answered. 
Semi-Streaming Model. To deal with graph problems in the streaming context 
Muthukrishnan|9 proposed the model of a semi-streaming algorithm: Random access 
to the input graph G is forbidden, on the contrary the algorithm gets the edges of G 
in arbitrary order as the input stream. The memory of the algorithm is restricted to 
0(n • polylogn) bits. That does not suffice to store all edges of G if G is sufficiently 
dense, i.e., m = uj(n ■ polylogn). A semi-streaming algorithm may read the input stream 
for a number of P passes. The parameter T denotes the per-edge processing time, that 
is, the time the algorithm needs to handle a single edge. 

Despite the heavy restrictions of the model there has been progress in developing 
semi-streaming algorithms solving graph problems. Feigenbaum et al.[2J, [3] present semi- 
streaming algorithms for testing fc-vertex and fc-edge connectivity of a graph, k being a 
constant. They point out how to find the connected components and a bipartition and 
how to calculate a minimum spanning tree of a weighted graph. Zelke[l3] showed how 
all these problems can be solved using only a constant per-edge processing time. 
Matching in the Semi-Streaming Model. There are approaches to find a weighted 
matching of a graph in the semi-streaming model. McGregor [B] presents an algorithm 
finding a (2 + ^-approximative solution with a number of passes P > 1 depending on e. 

However, for some real- world applications even a second pass over the input stream 
is unfeasible. If observed phenomena are not stored and must be processed immediately 
as they happen only a single pass over the input can occur. For the case of one-pass 
semi-streaming algorithms it is known, see [2J, that finding the optimal solution to the 
MWM problem is impossible in general graphs. A first one-pass semi-streaming algorithm 
approximating the MWM problem with a ratio of 6 presented in [2] was tweaked in [6 
to a ratio of 5.828, which was the best known ratio until recently. Both algorithms use 
only a per-edge processing time of 0(1). 

Our Contribution. In this paper we present a semi-streaming algorithm that runs in 
one pass over the input, has a constant per-edge processing time, and that approximates 
the MWM problem on general graphs with a ratio of 5.585. Therefore it surpasses the 
known semi-streaming algorithms computing a weighted matching in a single pass. In 
Section 2 we present our algorithm and its main ideas. While the proof of the approxi- 
mation ratio if found in Section 3, we conclude in Section 4. 

2 The Algorithm 

In a graph G — (V, E) let two edges be adjacent if they have a vertex in common. While 
M* denotes a matching of maximum weight in G let in the following M be the matching 
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Shadow Matching(G, k) 
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M := 
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while input stream is not empty 




Q 
O 


get next input edge 2/12/2 
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Let gij/i, 522/2 be the edges of M sharing a vertex with 2/12/2 
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aigi := shadow-edge (g%yi, gi) 
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a 2 g2 ■= shadow-edge((/22/2,32) 
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Let aici be the edge of M covering vertex a\ 




8 


Let a2C2 be the edge of M covering vertex 02 
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S ■= {2/12/2, 5i2/i, aigi, a\C\, (? 2 2/2, 0232, a2C 2 } 
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Find an augmenting set ACS that maximizes r(A) := w(A) — k ■ 


w(M(A)) 


11 


if r(A) > then 




12 


store each edge in M(A) as a shadow-edge of its adjacent edges 


in A 


13 


M := [M \ M{A)) U A 





Figure 1: The algorithm Shadow Matching 



of G that is currently under consideration by our algorithm. For a set of vertices W we 
call M(W) to be the set of edges in M covering a vertex in W. Correspondingly, for a set 
F of edges we denote by M(F) all edges in M that are adjacent to an edge in F. A set 
of edges in E \ M that are pairwise not adjacent we call an augmenting set. Throughout 
the whole paper k denotes a constant greater than 1. 

Our algorithm is given in Figure [T] Note at first that each edge in the algorithm 
is denoted by its endpoints, which is done for the sake of simpler considerations in the 
following on edges having common vertices. Every edge is well-defined by its endpoints 
since we assume the input graph G to contain neither multi-edges nor loops. 

The general idea of the algorithm is to keep a matching M of G at all times and to 
decide for each incoming edge 2/12/2 in the input stream if it is inserted into M. This is 
the case if the weight of 2/12/2 is big compared to the edges already in M sharing a vertex 
with 2/12/2 and that therefore must be removed from M to incorporate 2/1 2/2- 

This idea so far has already been utilized by one-pass semi-streaming algorithms of 
Feigenbaum et al.[5] and McGregor [5] seeking a matching in weighted graphs. However, 
our algorithm differs from the ones in [5j and [5j in fundamental points. 

First, if the algorithms in [2] and [B] remove an edge from the actual matching M 
this is irrevocable. Our new algorithm, by contrast, stores some edges that have been 




Figure 2: Example of an algorithm's step. Edges in M are shown in bold, shadow- 
edges appear in grey. 2/12/2 is the actual input edge shown dashed. The algorithm 
inserts the augmenting set A = {2/12/2, <Zi<7i} into M. Therefore the edges M(A) = 
{aici, 2/i2/i, 2/22/2} are removed from M, they become shadow-edges. 
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in M in the past but were removed from it. To potentially reinsert them into M the 
algorithm memorizes such edges under the name of shadow-edges. For an edge xy in 
M shadow- edge{xy 1 a), a s denotes an edge that is stored by the algorithm and 

shares the vertex a with xy. Every edge xy in M has at most two shadow-edges assigned 
to it, at most one shadow-edge is assigned to the endpoint x and at most one is assigned 
to y. 

A second main difference is the way of deciding if an edge e is inserted into M or not. 
In the algorithms of [5] and [5] this decision is based only on the edges in M adjacent 
to e. Our algorithm takes edges in M as well as shadow-edges in the vicinity of e into 
account to decide the insertion of e. 

Finally the algorithms of [2] and [S] are limited to the inclusion of the actual input 
edge into M. By reintegrating shadow-edges our algorithm can insert up to three edges 
into M within a single step. 

Let us take a closer look at the algorithm. As an example of a step of the algorithm, 
Figure [His given. But note that this picture shows only one possible configuration of the 
set S. Since non-matching edges in S may be adjacent, S may look different. 

After reading the actual input edge 2/12/2 the algorithm tags all memorized edges in 
the vicinity of 2/12/2 ■ This is done in lines 4-8. If an edge is not present the corresponding 
tag denotes the null-edge, that is, the empty set of weight zero. Thus if for example the 
endpoint 2/2 of the input edge 2/12/2 is not covered by an edge in M, the identifier (722/2 
denotes a null-edge, as well as its shadow-edge 02(72 and the edge 0202- All edges tagged 
so far are taken into consideration in the remaining part of the loop, they are subsumed 
to the set S in line 9. 

In line 10 all augmenting sets of S are examined. Among these sets the algorithm 
selects A that maximizes r(A). If r(A) > the edges of A are taken into M and the 
edges in M sharing a vertex with edges in A are removed from M. We say A is inserted 
into M, this is done in line 13. 

If an augmenting set A is inserted into M this is always accompanied by storing the 
removed edges M(A) as shadow-edges of edges in A in line 12. More precisely, every 
edge e in M(A) is assigned as a shadow-edge to every edge in A that shares a vertex 
with e. If, as in the example given in Figure A = {2/12/2, <ZiSi}> the edge giyi that is 
adjacent to both edges in A is memorized under the name shadow-edge(2/i2/2, 2/1) as well 
as under the name shadow-edge(ai<7i, g±). a\C\ is stored as shadow-edge(ai(/i, ai), (722/2 
as shadow-edge(2/ij/2, 2/2)- After inserting A, 02(72 is not memorized as a shadow-edge 
assigned to (722/2 since (722/2 is not an edge in M afterwards. That is indicated in Figure [2] 
by the disappearance of 02(72- However, if 0232 was memorized as a shadow-edge of 0,202 
before, this will also be the case after inserting A. 

It is important to note that there is never an edge in M which is a shadow-edge at 
the same time: Edges only become shadow-edges if they are removed from M. An edge 
which is inserted into M is no shadow-edge anymore, since there is no edge in M it could 
be assigned to as a shadow-edge. 

It is easy to see that our algorithm computes a valid matching of the input graph G. 

Corollary 1 Throughout the algorithm Shadow Matching (G, k), M is a matching of G. 

Proof. This is true at the beginning of the algorithm since M = 0. Whenever the 
algorithm modifies M in line 13 it inserts the edges in A, they are pairwise not adjacent, 
and removes all edges M{A) that are adjacent to an edge in A. Thus M never includes 
two adjacent edges. □ 
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Our algorithm may remind of algorithms in [JJ and [10] approximating a maximum 
weighted matching in the RAM model. Starting from some actual matching M in a 
graph G these algorithms look for short augmentations, that is, connected subgraphs of 
G having constant size in which edges in M and E\M can be exchanged to increase the 
weight of the actual matching. 

From this point of view our algorithm may suggest itself as it is reasonable to expect 
the notion of short augmentations to be profitable in the semi-streaming model as well. 
However, we are unable to use even the basic ideas of proving the approximation ratio in 
P] and [10] . As well as the algorithms the proof concept relies on random access to the 
whole graph, a potential we cannot count on in the semi-streaming model. 

Certainly our algorithm can be considered as a natural extension of the semi-streaming 
algorithms in [2] and [3] seeking a weighted matching. But the abilities of our algorithm 
go beyond the insertion of a single edge to the actual matching, the step to which the 
algorithms in [2] and [6j are limited to. Therefore we have to substantially enhance the 
proof techniques used therein to attest an improved approximation ratio of our algorithm. 
This is done in the next section. 

3 Approximation Ratio 

Consider an augmenting set A which covers the vertices B and let fc > 1 be some constant. 
We call fA,k '■ V — ► {x G R | < x < 1} an allocation function for A if fA,k( v ) = for all 
v G V \ B and additionally the following holds: 

. V ab G A : f A<k {a) ■ w(M(a)) + f A>k (b) ■ w(M(b)) < « 

. V cd G M(A) : f AM {c) + f Atk {d) > 1 

If there exists such an allocation function f Ajk for an augmenting set A we call A to be 
locally k-exceeding. The intuition here is as follows: If for an augmenting set A we have 
w(A) > k ■ w(M(A)) we can distribute the weight of the edges in M(A) to the edges of 
A in such a way that every edge ab in A gets weight of at most ^ distributed to it. If 
A satisfies the stronger condition of being locally fc-exceeding such a weight distribution 
can also be done with the additional property that the weight of an edge cd in M(A) is 
distributed only to edges in A that are adjacent to cd. 

Lemma 2 Every augmenting set A that is inserted into M by the algorithm Shadow 
Matching(G, k) is locally k-exceeding. 

Proof. Since A C {2/12/2, ai5i, fi2<?2} and r(A) > 0, 1 < \A\ < 3. If A consists of only 
one edge, say 2/12/2, we have for the sum of the weights of the adjacent edges w(gxyx) + 
w{g2V2) < w ( y ^ y2 ^ because of the satisfied condition in line 11. In that case the allocation 
function is f A; k(yi) = fA,k(y2) = 1 and A is locally fc-exceeding. 

Let A consist of two edges, say 2/12/2 and a%gi. Since every subset of A is an augmenting 
set as well which is not taken by the algorithm we know that r({j/i2/2, QiSi}) > ^({2/12/2}) 
and therefore 

w(Viy2) + w(axgi) - k(w(axa) + w(g 1 yx) + ^(522/2)) > w{yiy 2 ) - k(w(gxyx) + w(g 2 y2)) 

Thus w(axgx) > k ■ w(axcx) and because ^({2/12/2, > r ({ a i.9i}) we can deduce 

similarly 10(2/12/2) > k ■ ^((722/2)- Hence for the allocation function we can set f A ,k{a-i) = 
fA,kijj2) = 1- Since r(A) > we can find appropriate values for f At k(gx) and f A , k (yx), 
therefore A is locally fc-exceeding. 

For other configurations of A it can be exploited correspondingly that r(A) > r(A') 
for all subsets A' of A. Therefore it can be shown similarly that an allocation function 
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exists and A is locally fc-exceeding. 



□ 



Because of Corollary [T] we can take the final M of the algorithm as a valid solution for 
the weighted matching problem on the input graph G. It is immediate that the constant 
k is crucial for the weight of the solution we get and therefore determines the ratio up to 
which the algorithm approximates an optimal matching. The main part of the paper is 
to prove the following theorem which we just state here and which we prove later. 

Theorem 3 The algorithm Shadow Matching(G, k), k > 1, constructs a weighted match- 
ing M of G. M is at most a factor of 



smaller than the weight of an optimal weighted matching of G. 

We call Gi the subgraph of G consisting of the first i input edges, Mi denotes the M of 
the algorithm after completing the while-loop for the ith input edge. An edge xy prevents 
an edge ah if ab is the ith input edge and xy £ Mi shares an endpoint with ab, thus ab 
is not taken into M by the algorithm. Note that an edge might be prevented by one or 
two edges. An edge xy replaces an edge cd if xy is the ith input edge, xy and cd share 
a vertex, cd £ Mj_i, and xy £ Mi. Therefore cd is not in M afterwards. An edge can 
replace up to two edges and can be replaced by up to two edges. 

Consider an optimal solution M* = {01,02, . . .} for the MWM problem of G, M* := 
M* n Gi. The edges oi,o 2 , ... in M* we call optimal edges. If w(M, t ) < w(M*), some 
edges of M* must be missing in Mi. There are two possible reasons for the absence of 
an edge o\ £ M* in Mj. First, there are edges in Mj, j < i, which prevented o;. Second, 
o; G Mj, j < i, is replaced by one or more edges and never reinserted. 

In any case we can make edges in {J h<i Mh responsible for missing edges of M* in Mj. 
We charge the weight of an optimal edge 01 to the edges in \J h<i Mh that are responsible 
for the prevention or the removal of o; . If such a charged edge in M is replaced by other 
edges its charge is transferred to the replacing edges such that no charge is lost. After 
all we can sum up the charges of all edges in the final M m to get w(M* \ M m ). 

To bound w(M* \ Mi) as a multiple c of w(Mi) if suffices to show that each edge 
xy £ Mi carries a charge of at most c • w(xy). This technique has been carried out 
by Feigenbaum et al.[2] and McGregor [6] to estimate the approximation ratios of their 
semi-streaming algorithms calculating a weighted matching. 

We follow the same general idea but need a more sophisticated approach of managing 
the charge. This is due to two reasons. First, the algorithms of [5] and [S] are limited 
to a simple replacement step which substitutes one or two edges by a single edge e. 
That makes the charge transfer easy to follow since the charges of the substituted edges 
are transferred completely to the single edge e. Our algorithm, by contrast, is able 
to substitute several edges by groups of edges. The charge to be transferred must be 
distributed carefully to the replacing edges. 

Second, in the algorithms of [2] and [H] the decision whether to insert an input edge 
into M is determined only by the edges in M adjacent to the input edge. If an optimal 
edge o is not taken into M the charge can simply be assigned to the at most two edges 
already in M that are adjacent to o. In our algorithm not only the edges in M that are 
adjacent to o specify if o is taken into M. In fact, several shadow-edges and other edges 
in M in the environment of o may codetermine if o is inserted into M. These ambient 
edges must be taken into account if charge has to be distributed for preventing o. 

For our more sophisticated technique of managing the charges we think of every edge 
xy £ M as being equipped with two values, namely charge of optimal edge coe{xy,x) 



k + 



fc-1 



k 



+ 



k 3 - k + 1 
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and coe(xy,y), one for every endpoint of xy. coe(xy,x) is the charge that the edge in 
M* which is covering the vertex x is charging to xy. 

If an edge is removed from M its charges are transfered to the one or two replacing 
edges. Therefore in addition to its coe(xy,x) and coe(xy,y) every edge xy £ M is 
equipped with a third value aggregated charge ac(xy) which is the sum of the charges 
that xy takes over from edges replaced by xy. We define T(xy) :— coe(xy, x)+coe(xy, y) + 
ac(xy) as the sum of the charges of the edge xy. 

During the proof of the following lemma we will explicitly show how the weights of 
edges in M* \M t can be charged to the edges in Mi and how these charges are transferred 
to replacing edges such that particular properties hold. 

Lemma 4 Let Mi be the solution found by the algorithm Shadow Matching(G,k), k > 1, 
after reading Gi for 1 < i < m. To every edge xy in Mi we can assign three values 
coe(xy,x), coe(xy,y) and ac{xy), with T(xy) being their sum, such that: 

a) E T(xy)>w{M*\M l ) 

xyeMi 

b) V xy £ Mi: coe(xy, x) < k ■ w(xy) and coe(xy, y) < k ■ w(xy) 

c) V xy £ Mi: ac(xy) < ^ • w(xy) 

d) Vxy£ M i: T(xy) < (k + ^ + ^S^) ■ w(xy) 

Proof. Let yiy2 be the actual input edge. We first take a look at the different cases 
that can occur if 2/12/2 is not taken into M by the algorithm. We postpone the cases 
in which a\g\ and a 2 g 2 are adjacent, thus until further notice the set {0131,0252} is an 
augmenting set. If 2/12/2 as the actual input edge is not taken into M this is due to two 
possible reasons. First, no augmenting set is inserted into M. Second, the augmenting 
set that is inserted does not contain 2/1 2/2- If the first case occurs at least one of the 
following conditions is satisfied: 

i) w(ym) < min{fc • (w(#i?/i) + 10(522/2)), 

k ■ (w(c7ij/i) + w{a- l c- l )) - tu(ffli5i) + k ■ w(g 2 y 2 ), 
k ■ {w(g 2 y 2 ) + w(a 2 c 2 )) - w(a 2 g 2 ) + k ■ w{g\yi), 

k ■ (w(3ij/i) + w{a 1 c 1 )) - tu(ai0i) + k ■ (w(g 2 y 2 ) + w(a 2 c 2 j) - w(a 2 g 2 )} 

ii) w{y 1 y 2 ) < min{fc • (w(gi2/i) + w(g 2 y 2 )), 

k ■ (w(g 2 y 2 ) + w(a 2 c 2 )) - w(a 2 g 2 ) + k ■ w{g\y{)} and y 2 = 01 

iii) w{y 1 y 2 ) < min{fc • {w{g x y 1 ) + w(g 2 y 2 )), 

k ■ (w(giyi) + lo(aici)) - w(aigi) + k ■ w(g 2 y 2 )} and 2/1 = a 2 

iv) w{yiy 2 ) < k ■ (w(giyi) + w(g 2 y 2 )), y\ = a 2 , and y 2 = a x 

Condition i) captures the situation where {2/12/2, aigi, a 2 g 2 } is an augmenting set but 
neither this set nor one of its subsets satisfies the condition in line 11 of the algorithm. 
Conditions ii) — iv) include the cases in which 2/12/2 is adjacent to oi(?i, a 2 g 2 , or both. 

As mentioned above the algorithm possibly inserts an augmenting set A into M that 
does not contain 2/12/2- Exploiting the fact that r{A) > r(A') for all other augmenting 
sets A' we get that at least one of the following conditions is satisfied in this case. 

v) w(yiy 2 ) < min{fc • w(g 2 y 2 ), k ■ (w(g 2 y 2 ) + w(a 2 c 2 )) - w(a 2 g 2 )} and A = {aigi} 

vi) w(y 1 y 2 ) < min{fc • w{g\y{), k ■ (w((/i?/i) + tu(aiCi)) - w(a 1 g 1 )} and A = {a 2 g 2 } 

vii) w(y 1 y 2 ) < k ■ w(g 2 y 2 ), 2/1 = a 2 , and A = {axgx} 
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viii) w{yiy 2 ) < k ■ w(gxyi), y 2 = cti, and A = {a 2 g 2 } 

ix) w{yiy 2 ) < w{a\g\), y 2 = »i, and {aigi} C A 

x) w{yiy 2 ) < w(a 2 g 2 ), yi = a 2 , and {a 2 g 2 } C A 

xi) w(yiy 2 ) < u?(aipi) + w(a 2 g 2 ), y 2 = a±, yi = a 2 , and A = {aigi, a 2 g 2 } 

If the ith input edge yiy 2 E M* is not taken into Mj we have to charge the edges in 
Mi that prevent the optimal edge y\y 2 . In the cases i) — viii) the edges giyi and/or 
<?2?/2 prevent yiy 2 , in the cases ix) — xi) a\g\ and/or a 2 g 2 prevent y\y 2 - To charge the 
preventing edges in Mj we split w{y\y 2 ) into two partial weights and charge one partial 
weight to the edge in M, covering y\ and one to the edge in Mj covering y 2 . In any of the 
above cases w(y\y 2 ) can be split into two partial weights in such a way that the following 
generalization holds. 

Let ab € Mi share the vertex a with the ith input edge o S M* . Let be be the shadow- 
edge(a&, b), that is, the shadow-edge assigned to the vertex of ab that is not shared by o. 
Let cd be the edge in Mi that covers c. w(o) can be split into two partial weights such 
that for the partial weight p that ab has to take as a charge for preventing o at least one 
of the following conditions is satisfied: 

(A) p < k ■ w(ab) < k ■ (w(ab) + w(cd)) — w(bc) 

(B) p < k ■ (w(ab) + w(cd)) — w(bc) < k ■ w(ab) 

(C) p < k ■ w(ab) and ab, input edge o and shadow-edge be form a triangle. 

We start to prove the lemma by induction over the edges inserted into M . More precisely 
we suppose that the edge yiy 2 as the ith input edge is inserted into Mj_i and that before 
this insertion all properties of the lemma are satisfied. 

We have to consider two things: First, we have to point out how the charges of the 
edges in Mj_i that yiy 2 replaces are carried over to y\y 2 to preserve the properties of 
the lemma. Second we have to regard the at most two optimal edges that possibly come 
after y\y 2 and share a vertex with y\y 2 . If y\y 2 prevents one or both of these edges we 
have to show how y\y 2 is charged by them without violating the lemma. 

For the initial step of our induction note that the properties of the lemma hold for the 
first input edge. 

For the inductive step let y\y 2 as the ith input edge be taken into Mj. Thus y\y 2 is 
contained in the augmenting set A that is inserted into M. Because of Lemma [5] A is 
locally fc-exceeding, hence there exists an allocation function fA,k- 

Let in the following x 6 {1, 2}. y±y 2 takes over charges from g x y x , the edges it replaces, 
according to the allocation function fA,k- More precisely it takes over a /^(y^-fraction 
of the charges of g x y x . In fact, y\y 2 builds its ac as follows: ac(\j\y 2 ) — [coe{g\yi, g\) + 
ac(giyi)) ■ fA,k{y\) + {coe(g 2 y 2l g 2 ) + ac{g 2 y 2 )) ■ /aAv^)- By the induction hypothesis 
coe(g x y Xl g x ) < k ■ w(g x y x ) and ac(g x y x ) < ^ • w(g x y x ). Due to the definition of an 

allocation function fA,k{yi) ' w{9Wi) + fA,k(ys) • w(g 2 y 2 ) < HMM. Thus ac(yiy 2 ) < 
tPT • w(yiy 2 ) satisfying property c). 

Furthermore y\y 2 takes over charge from coe(g x y x ,y x ) to its own coe(yiy 2 ,y x ), again 
a /A,fe(j/x)-fraction of it. If g x y x is in M* , coe(g x y x ,y x ) = and y\y 2 instead takes over 
a / J 4,fc(y a; )-fraction of w(g x y x ) as its coe(yiy 2 ,y x ) for replacing the optimal edge g x y x . 

Note that whenever fA,k(y x ) < 1, yW2 does not take over all the charge of g x y x . 
However, the definition of the allocation function makes sure that fA,k(g x ) > 1 — fA,k(Ux) 
and that another edge in A covering g x takes over the remaining charge of g x y x . That 
way no charge can get lost and property a) holds. 
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Let us check the validity of property b). Right after 2/12/2 was inserted into M and 
took over the charges as described from g x y x it holds that coe(y\y2, y x ) < w (2/iJ/2)- That 
does not suffice to show validity of property b). In fact, there might be an optimal edge 
o x y x coming after 2/12/2 in the input stream covering y x . In that case coe(yiy 2 ,y x ) = 
up to this moment, since there cannot be another optimal edge besides o x y x covering y x . 
If o x y x is not inserted into M, that is, 2/12/2 prevents o x y x , 2/12/2 must be charged. By the 
considerations above we know about the charges that an edge in M has to take because 
of optimal edges prevented by it. In all three possibilities (A)-(C) the charge 2/12/2 has to 
include into coe(yiy2,y x ) for preventing o x y x is at most k • w(yiy 2 ), satisfying property 
b). 

It remains to show that property d) holds which bounds the sum of all charges of 
2/i2/2- The situation is as follows: 2/12/2 is in M and we call the shadow-edge(2/ij/2, 2/1) 
c/12/1, the shadow-edge(2/i2/2,2/2) 522/2- Note again that 2/12/2 took over only a f A ,k(yx)- 
fraction of the charges from g x y x . Directly after 2/12/2 was inserted into M and took over 
the charges from the replaced edges as described property d) holds. We have to consider 
optimal edges o x y x that appear after 2/12/2 in the input stream, are prevented by 2/12/2 and 
therefore cause charge p x at coe (2/1 2/2, 2/aO- 

As described 00(2/12/2) is composed of four values, namely fractions of ac(g x y x ) and 
coe(g x y y ,g x ). The value of the fraction of ac(g x y x ) that is part of ac(yiy 2 ) we call 
ac(g x y x ) rx 00(2/12/2), correspondingly we have coe(g x y x ,g x ) rx ac{yiy 2 ). Using that we 
can separate T(?/i2/2) into two halves as follows 

7X2/12/2) = (coe(yiy2, 2/2) + ac{g\y\) rx ac{yiy 2 ) + coe{g 1 y 1 ,g 1 ) rx 00(2/12/2)) + 

{coe (yi 2/2, 2/1) + ac(g 2 y2) rx ac{yiy 2 ) + coe (g 2 2/2, 9i) rx 00(2/12/2)) 

Let us call the upper half HI and the lower one H2. We will estimate H2 in the following 
according to the three possible cases for p\ and show that 

H2 < (k + -j-^-j + w{g 2 y 2 ) ■ fA,k(V2) + k ■ w{ yi y 2 ) (1) 

We will see later that it suffices to show that if neither H2 violates (1) nor HI violates 
a corresponding inequality, property d) holds for 2/12/2 • 

Charge p\ coming from 0\y\ satisfies (A) 

Let 2/2^2 be an edge in M covering g 2 ■ We can bound pi because of property (A) 

Pi < k ■ w{y±y 2 ) < k ■ {w(yiy 2 ) + w(g 2 z 2 )) - w(g 2 y2) (2) 

We call the shadow-edge 2/22/2 of 2/12/2 overloaded if we have 006(2/22/2,2/2) rx 00(2/12/2) > 
^(2/22/2) • fA.k{y2)- For a shadow-edge uv we say that uv fingers v if uv covers v and v 
is not the vertex that uv shares with the edge in M it is assigned to. For example the 
shadow-edge 2/22/2, which is assigned to 2/12/2, fingers 2/2 but not 2/2- A shadow-edge uv is 
prepared if for the edge uw in M that uv is assigned to coe(uw, w) — 0. So in the present 
example 2/22/2 is prepared if coe {y\ 2/2, 2/1) = 0. 

If Pi < k ■ w(j/i2/2) — fA,k{y2) • w(g2y 2 ) or if 2/22/2 is not overloaded, we can simply add 
pi to coe(2/i2/2, 2/i) and H2 satisfies (1). Otherwise we do a charge transfer as follows: We 
reduce coe (g 2 2/2, 92) rx 00(2/12/2) tor := max{coe(c722/2, 32) rx ac(yiy 2 ) — (k—l)-w(g2Z 2 ), 0} 
and add a value of coe (2/2 2/2, 2/2) rx- 00(2/12/2) - f to coe (2/222, 2/2), thus no charge is lost. 

It is important to see that this increasing of coe{g2Z2, 2/2) does not violate the prop- 
erties of the lemma for 2/2^2^ We know that coe(2/2^2, ^2) < k ■ ^(2/222) and 00(2/2^2) < 
• w{g2Z2)- If before the charge transfer 006(2/2^2,2/2) = 0, after the transfer T{g2Z2) 
cannot exceed (k + + k ' w {d2Z2)- 
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For the other case, i.e., that coe(g2Z2, 32) > before the charge transfer we need a 
few considerations. In fact, we will show that for every vertex v at every moment of the 
algorithm at most one shadow-edge fingers v, is overloaded, and prepared at the same 
time: 

Assume that uv is the first shadow-edge created by the algorithm that is fingering v 
and that is overloaded and prepared. This can only be the case if uv in M gets replaced 
by uw and possibly vs. uv as a shadow-edge of uw is now fingering v and it is overloaded 
and prepared. Right after the replacement coe(vs,v) < w(vs). As long as no charge of 
coe(uv, v) r\ ac(uw) is transferred to an edge in M covering v, for every edge vq in M 
coe(vq, v) < w(vq). Such an edge vq cannot be turned into a shadow-edge fingering v and 
being overloaded. A second overloaded shadow-edge fingering v can only be created by 
replacing an edge vr with coe(vr, v) > w(vr), that can only occur if uw transfers charge 
to vr. However, uw only transfers charge to vr if it prevents an optimal edge. After 
that coe(uw, w) > and uv is not prepared anymore. This shows that a prepared and 
overloaded shadow-edge fingering v can only be created if the at most one previously 
prepared and overloaded shadow-edge fingering v lost its status as being prepared. 

Now we can come back to the case coe(g2Z2, 52) > 0. We can assume that 32^2 as part 
of the augmenting set A' replaced the edges ^2^2 and t 2 Z2- 92Z2 took over a fA',k{92)- 
fraction of the charges from g?2</2- Since coe(d 2 g 2 , 52) < k-w(d 2 g 2 ) before the replacement 
of d 2 g 2 , we have coe(g 2 z 2 , 32) < fA',k{92) • k ■ w(d 2 g 2 ) after the replacement. By the 
definition of an allocation function it follows coe(g2Z2, 52) < w(g 2 z 2 )—fA',k{z 2 )-k-w(t 2 z 2 ). 
After our charge transfer of weight at most (k — 1) • w(g 2 Z2) from coe(g 2 y 2 , 32) ^ o,c{y\y 2 ) 
to coe (g 2 z 2l g 2 ), it holds that coe(g 2 z 2l 92) < k ■ (w(g 2 z 2 ) - f A' ,k{z 2 ) • w(t 2 z 2 )). Therefore 
the charges of g 2 z 2 satisfy an inequality corresponding to (1), thus property d) cannot 
be violated for g 2 z 2 . 

Now the above considerations are important: We know that no shadow-edge besides 
g 2 y 2 that is fingering g 2 is prepared and overloaded. Thus no further charge transfer to 
coe(g 2 z 2 , g 2 ) can occur violating the properties of the lemma for g 2 z 2 . 

After transferring a part of coe(g 2 y 2 , g 2 ) r\ ac(yiy 2 ) as described we have coe(g 2 y 2 , g 2 ) 
ac(y 1 y 2 ) < max{ k ■ fA,k{Vi) -w(g 2 y 2 ) - (k- 1) • w(g 2 z 2 ), 0}. We add pi to coe(yiy 2 ,y 1 ) 
and can evaluate HI: We have coe(yiy 2l yi) = pi < k ■ w(yiy 2 ) because of (2) and 
ac(g 2 y 2 ) r\ ac(yiy 2 ) < /a,*; (2/2) ■ w{g 2 y 2 ) ■ j^— by the induction hypothesis. Since 
w{g 2 z 2 ) > W ^ V2 ^ because of (2) we can estimate H 2 as being bounded as in (1). 

Charge pi coming from 0\y\ satisfies (B) 

This case is very similar to the previous one with the only difference that w(g 2 z 2 ) < 
W ^ 92V2 ' > and we use p\ < k ■ (w(yiy 2 ) + w(g 2 z 2 )) — w(g 2 y 2 ). All other considerations 
remain the same and that results in the very same estimation for H2. 

Charge p\ coming from 0\y\ satisfies (C) 

In this case o\ = g 2 since the input edge oij/i, the edge y\y 2 £ M and the shadow-edge 
32 J/2 form a triangle. Since 32J/1 is an optimal edge, before its arrival coe (32 J/2, 92) ^> 
ac(yiy 2 ) = 0. So y\y 2 can take a charge of p\ < k ■ w{y\y 2 ) as its coe(j/ij/2, J/i) and H2 
satisfies (1). 

We can handle the charge p\ in every possible case such that H2 satisfies (1). With 
a symmetric argumentation we can show that HI satisfies a corresponding inequality. 
Using that f A ,k{yi) ■ w{g 1 y 1 ) + f A ,k{V2) ■ ^(322/2) < t " (y fc iy2) we get 

T( ym ) = Hl + H2<fk+^- [ + k3 ^ 2 + 1 ) ■ 
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It remains to consider the postponed cases in which 3132 is not inserted into M by 
the algorithm and a\g\, 0.232 have a vertex in common, hence cannot be taken into M 
simultaneously. 

If a\ =92, the augmenting set A = {yiy2, ai3i} and M(A) build a cycle on 4 vertices. 
If on the one hand 02 31 in this situation and no augmenting set is inserted into M, 
case i) is satisfied, if 0232 is inserted into M case vi) is met. If on the other hand 02 = yi 
the cases Hi) or x) are applicable. 

If cti = a.2 and 0232 is taken into M, after that 0232 is the edge in M that covers 
ai, thus 0,232 can be qualified as the edge a\C\ in our notation. Using that case vi) is 
applicable. 

The last possibility is the one in which a\ = 02 and no augmenting set is inserted 
into M. Assume now that this is the case, thus the situation is as follows: 3131 and 
g 2 y2 are in M, a\g\ = shadow-edge(gij/i, g x ) and a 2 .g 2 = shadow-edge(g 2 ?/2, 32)- giVi 
took over a /^/^(g^-fraction of the charges from a\g\ when replacing it, 322/2 took over 
a / J 4",fe(32)-fraction of the charges from 0232- Since a\ = 0,2 it is also c\ =03. 

Let fA',k{gi) ■w(a>igi) > fA",k{g2) • ^(0232)- It suffices to consider y\y% as an optimal 
edge since otherwise no charge must be assigned if 3132 is prevented and the properties 
of the lemma hold further on. 

Prior the arrival of 3132, 000(3131, 31) = 000(3232, 2/2) = 0, thus a\g\ and 02172 are both 
prepared and fingering ai . If coe(a 2 g2, a 2 ) rx 00(322/2) = Ja" ,k(g2)-w(a 2 g2)+X for X > 0, 
&2.02 is overloaded, thus coe{a\g\,a\) rx 00(3131) < fA'.k(gi) • w{o,i9i) since oi<?i cannot 
be overloaded as well. X cannot be greater than {k — 1) ■ fA",k{g2) • ^(0232), therefore 
we can transfer a charge of weight X from 006(0232,02) rv 00(322/2) to coe{a\g\,a{) rx 
ac(giyi), aigi might get overloaded, 0232 is not overloaded anymore. 

After this transfer of charge, or if no transfer was necessary because X < 0, we have 
coe(o232,a 2 ) rx 00(3232) < fA",k(92) • w(a 2 3 2 ). Thus 000(3232,32) can take a charge of 
k • 1^(3232) without violating the properties of the lemma since in that case 000(3232, 32), 
000(0232, 02) rx 00(3232) and 00(0232) rx 00(3232) still satisfy an inequality corresponding 
to (1). If no augmenting set is inserted into M, 1^(3132) < min{fc- (10(312/1) + 10(322/2)), k- 
(10(312/1) + tu(oiCi)) — 10(0131) + k ■ 10(3232)}- Therefore the partial weight of 3132 that 
3131 has to take as charge for preventing 3132 satisfies the properties (A) or (B). 

We showed that the properties a)-d) of the lemma hold when 3132 replaces and prevents 
edges. In the very same way the validity of the properties can be shown for the edges 
ai3i and/or 0232 that are possibly taken into M at the same time as 3132- □ 

Using Lemma |4] we can prove our main theorem. 

Proof of Theorem^ Let M be the final M m . w(M*) = w(M* D M) + w(M* \ M). 
Because for an edge xy E M* n M we have coe(xy, x) = coe(xy, y) = 0, we can write 

k 

w(M* \ M) < 22 - — - • w(xy) + T ( uv ) 

xy£M*nM uv£M\M* 

That results in w(M*) < [k + + • w(M). □ 

The term describing the approximation ratio of our algorithm reaches its minimum for 
k being around 1.717, that yields a ratio of 5.585. It is easy to see that the algorithm 
does not exceed the space restrictions of the semi-streaming model: It needs to memorize 
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the edges of M, for each of those at most two shadow-edges, thus it suffices to store a 
linear number of edges. The time required to handle a single input edge is determined 
by the size of S. Since S is of constant size, a single run of the while loop, including 
the enumeration and comparison of all possible augmenting sets of S, can be done in 
constant time. Therefore the algorithm needs a per-edge processing time of 0(1). As 
well as the single pass over the input this is optimal. 



4 Conclusion 

We presented a semi-streaming algorithm calculating a weighted matching in a graph 
G. Our algorithm achieves an approximation ratio of 5.585 and therefore surpasses all 
previous algorithms for the maximum weighted matching problem in the semi-streaming 
model. In addition to the edges of an actual matching M the algorithm memorizes some 
more edges of G, the so called shadow-edges. For each input edge e, the subgraph S 
made up of e and of shadow-edges and edges of M in the vicinity of e is examined. If a 
certain gain in the weight of M can be made, matching and non-matching edges in S are 
exchanged. 

The subgraph S investigated by our algorithm for each input edge consists of at most 
seven edges. It is reasonable to assume that by examining bigger subgraphs the approxi- 
mation ratio can be enhanced further. Therefore we believe that extending our approach 
will lead to improved semi-streaming algorithms computing a weighted matching. 



References 

D. E. Drake Vinkemeier, S. Hougardy. A linear-time approximation algorithm for weighted 
matchings in graphs. ACM Transactions on Algorithms (TALG) 1(1), 107-122, 2005. 
J. Feigenbaum, S. Kannan, A. McGregor, S. Suri, and J. Zhang. On graph problems in a 
semi-streaming model. ICALP 2004, In: LNCS 3142, 531-543, 2004. 

J. Feigenbaum, S. Kannan, A. McGregor, S. Suri, and J. Zhang. Graph Distances in the 
Streaming Model: The Value of Space. In: Proc. ACM-SIAM SODA (2005): 745-754, 2005. 
H.N. Gabow. Data structures for weighted matchings and nearest common ancestors with 
linking. In: Proc. ACM-SIAM SODA (1990), 434-443, 1990. 

M. R. Henzinger, P. Raghavan, and S. Rajagopalan. Computing on data streams. In: Exter- 
nal Memory Algorithms, Dimacs Series In Discrete Mathematics And Theoretical Computer 
Science, 50:107-118, 1999 

A. McGregor. Finding Graph Matchings in Data Streams. APPROX and RANDOM 2005, 
In: LNCS 3624, 170-181, 2005. 

R.H. Mohring, M. Muller-Hannemann. Complexity and modeling aspects of mesh refine- 
ment into quadrilaterals. Algorithmica 26, 148-171, 2000. 

B. Monien, R. Preis, and R. Diekmann. Quality matching and local improvement for mul- 
tilevel graph-partitioning. Paral. Comput. 26, 1609-1634, 2000. 

S. Muthukrishnan. Data streams: Algorithms and applications. 2003. Available at 



http:/ /athos. rutgers.edu/~muthu/stream-l-l.ps 



S. Pettie, P. Sanders. A simpler linear time 2/3-e approximation for maximum weight 
matching. Information Processing Letters 91(6), 271-276, 2004. 

R. Preis. Linear time 1/2-approximation algorithm for maximum weighted matching in gen- 
eral graphs. 16th Annual Symposium on Theoretical Aspects of Computer Science STACS 
1999, In: LNCS 1563, 259-269, 1999. 

A. Schrijver. Combinatorial optimization: polyhedra and efficiency. Springer, Heidelberg, 
2003. 

M. Zelke. Optimal Per-Edge Processing Times in the Semi-Streaming Model. Information 
Processing Letters 104(3), 106-112, 2007. 



12 



